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Given some observable H on a finite-dimensional quantum system, we investigate the typical properties of 
random state vectors \tp) that have a fixed expectation value = E with respect to H. Under some 

conditions on the spectrum, we prove that this manifold of quantum states shows a concentration of measure 
phenomenon: any continuous function on this set is almost everywhere close to its mean. We also give a method 
to estimate the corresponding expectation values analytically, and we prove a formula for the typical reduced 
density matrix in the case that is a sum of local observables. We discuss the implications of our results as new 
proof tools in quantum information theory and to study phenomena in quantum statistical mechanics. As a by- 
product, we derive a method to sample the resulting distribution numerically, which generalizes the well-known 
Gaussian method to draw random states from the sphere. 



I. INTRODUCTION 

The term concentration of measure phenomenon refers to 
the observation that in many high-dimensional spaces "contin- 
uous functions are almost everywhere close to their mean". A 
well-known illustration is the fact that on a high-dimensional 
sphere "most points lie close to the equator". In other words, 
the values of the coordinate functions concentrate about 0, 
their mean. On the sphere, the effect exists not only for co- 
ordinate functions, but for any Lipschitz-continuous function. 
The result — known as Levy's Lemma — has surprisingly many 
applications in both mathematics and physics (see below). 

Our main contribution is a "Levy's Lemma"-type concen- 
tration of measure theorem for the set of quantum states with 
fixed expectation value. 

More concretely, suppose that we are given any observable 
H = W with eigenvalues {Ek}^^i on C". In the following, 
we will often call H a "Hamiltonian" and Ek the "energy lev- 
els", but this is not the only possible physical interpretation. 
We fix some arbitrary value E, and we are interested in the set 
of pure quantum states with fixed expectation value E, i.e.. 

Me := e C" I {^p\H\^/J) = E and jlV^jl = 1} . 

Our main Theorem [1] shows that, subject to conditions on the 
spectrum of H, any continuous function on Me concentrates 
about its mean. 

The motivation for the approach taken here is two-fold. 



A. Motivation 1: The probabilistic method 

Beyond being a geometric curiosity, the concentration of 
measure effect is a crucial ingredient to an extremely versa- 
tile proof technique: the probabilistic method Recall the 



basic idea. Assume, by way of example, one wants to ascer- 
tain the existence of a state vector t/i on n qubits, such that ijj is 
"highly entangled" with respect to any bipartition of the n sys- 
tems into two sets. The problem seems daunting: There are 
exponentially many ways of dividing the composite system 
into two parts. For any bipartition, we need to make a state- 
ment about the entropy of the eigenvalue distribution of the 
reduced density matrix - a highly non-trivial function. Lastly, 
in any natural parametrization of the set of state vectors, a 
change of any of the parameters will affect the vast majority 
of the constraints simultaneously. 

Given these difficulties, it is an amazing fact that the proba- 
bilistic method reduces the problem above to a simple lemma 
with a schematic proof (detailed, e.g., in Ref. QBl)- Neither 
the non-trivial nature of the entropy function, nor the details 
of the tensor product space from which the vectors are drawn 
enters the proof. Only extremely coarse information - the Lip- 
schitz constant of the entropy and the concentration properties 
of the unit sphere - are needed. 

Consequently, proofs based on concentration properties are 
now common specifically in quantum information theory. Ex- 
amples include the investigation of "generic entanglement" 
Htl, random coding arguments to assess optimum rates in 
quantum communication protocols, state merging jsd, the cel- 
ebrated counterexample to the additivity conjecture in quan- 
tum information theory H], or the resource character of quan- 
tum states for measurement-based computing 

The tremendous reduction of complexity afforded by the 
probabilistic method motivates our desire to prove measure 
concentration for other naturally occurring spaces, besides the 
sphere. For the set of "states under a constraint". Theorem 1 
achieves that goal and opens up the possibility of applying 
randomized arguments in this setting. 



B. Motivation 2: Statistical mechanics 



'Electronic address: 'mueller@math. tu-berlin.de| 
^Electronic address: david. gross @ it p.uni-hannover.de | 
■f Electronic address: jense@qipc.org| 



The second motivation draws from notions of quantum sta- 
tistical mechanics jgl- fisll . The predictions of statistical me- 
chanics are based on ensemble averages, yet in practice prove 
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to apply already to single instances of thermodynamical sys- 
tems. This phenomenon needs to be explained. It becomes 
at least plausible if there is a measure concentration effect on 
the ensemble under consideration. Concentration implies that 
any observable will give values close to the ensemble mean 
for almost every state in the ensemble. This will in particular 
happen at almost every point on a sufficiently generic trajec- 
tory through the ensemble. Thus there may be an "apparent 
relaxation" ifiol [TTl [TsUlsIl even in systems not in a global 
equilibrium state. 

Recently, several authors realized that it is particularly sim- 
ple to state a precise quantitative version of this intuition for 
ensembles consisting of random vectors drawn from some 
subspace ifiol [isll . However, in the context of statistical me- 
chanics, it may be more natural to consider sets of states with 
prescribed energy expectation value, rather than elements of 
some linear subspace. Indeed, such "mean energy ensembles " 
have been studied before lfl9l422ll . Thus, it is natural to ask 
whether the concentration results for linear spaces translate to 
mean energy ensembles. 

We present both positive and negative results on this prob- 
lem. Since the mean energy ensemble and its properties de- 
pend on the spectrum of the chosen observable H, so does the 
degree of measure concentration. For many spectra typically 
encountered in large many-body systems, our main theorem 
yields trivial bounds. As explained in Section|IIIl this is partly 
a consequence of the fact that "Levy's Lemma-type" expo- 
nential concentration simply does not exist for such systems. 
However, for families of Hamiltonians with, for example, con- 
stant spectral radius, we do get meaningful concentration in- 
equalities. Therefore, the methods presented in this paper are 
expected to have a range of applicability complementary to 
other approaches. 

The question whether weakened concentration properties 
can be proven for more general many-body systems under 
energy constraints remains an interesting problem (see Sec- 
tionHID. 



II. MAIN RESULTS AND OVERVIEW 

As stated above, we will analyze the set 

Me {IV') G C" I (V'l-fflV') = E and ||?M| = 1} 



to their mean on almost all points of Me- In some sense, 
almost all quantum states with fixed expectation value be- 
have "typically" . To understand the theorem, note that Me 
is invariant with respect to energy shifts of the form E' 
E + s, H' H + si, such that the new eigenvalues are 
E'f^ := Ek + s. We then have M'^, = Me, i.e., the man- 
ifold of states does not change (only its description does). 
We call a function / : Me K X-Lipschitz if it satisfies 
\f{x) — f{y)\ < A||a; — ?/||, where || • j| denotes the Euclidean 
norm in C" . 

Theorem 1 (Concentration of measure). Let H = be 

any observable on C", with eigenvalues {EkY^^i, Emin '■= 
min^. El;, E,„ax '■— max;; Ei;, and arithmetic mean E^ '■= 
i Ek- Let E > Emin be any value which is not too close 
to the arithmetic mean, i.e.. 



E<Ea 



Suppose we draw a normalized state vector lip) S C" ran- 
domly under the constraint {tj}\H\^) = E, i.e., G Me is 
a random state according to the natural distribution described 
above. Then, if f : Me — > K is any X-Lipschitz function, we 
have 

Prob { ~f\>Xt]<a- nle-™(*^a^)'+2ev^, (1) 

where f is the median of f on Me, and the constants a, c and 
e can be determined in the following way: 

• Shift the energies by some offset s (as described above) 
such that E':„ > and 



E' 



1 

1 + - 

n 



l + ^\E' 



(2) 



with e > 0, where E'^ denotes the harmonic mean en- 
ergy. The offset may be chosen arbitrarily subject only 
to the constraint that the constant a below is positive. 

• Compute c = 3£'/,„„/ (32£") and 



E'n 



for some observable H and expectation value E. The set of all 
pure quantum states is a complex sphere in C"; equivalently, 
we can view it as the unit sphere 5^"^^ in R^". The obvi- 
ous geometric volume measure on S*^""^ corresponds to the 
unitarily invariant measure on the pure quantum states istl- As 
we will see below, the set Me is a submanifold of the sphere 
(and thus of R^"); hence it carries a natural volume measure 
as well, namely the "Hausdorff measure" lEsIl that it inherits 
from the surrounding Euclidean space R^". Normalizing it, 
we get a natural probability measure on Me- 

Our first main theorem can be understood as an analog of 
Levy's Lemma ||24|| for the manifold Me- It says that the 
measure on Me is strongly concentrated, in the sense that 
the values of Lipschitz-continuous functions are very close 



"iMQE't 



e^E' 



The theorem involves an energy offset s, shifting all energy 
levels to E'^. := Ek + s. The idea is to choose this shift such 
that E' ss E'u, i.e. such that the energy in question becomes 
close to the harmonic mean energy (we show in Lemma |20l 
below that this is always possible). Specifically, the theorem 
demands that E' becomes a bit larger than E'j^, resulting in a 
constant e > defined in eq. (|2]i. The theorem does not spec- 
ify s uniquely - there is some freedom for optimizing over 
the different possible choices of s. However, there is the con- 
straint that a > 0, which prevents us from choosing too small 
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values of e (indeed, a > is equivalent to e > E' /E'q). On 
the other hand, e should not be too large, because it appears in 
the exponent in eq. ([T]i. 

To apply the theorem, it is often useful to know the value of 
the median /. Our second main theorem gives an approxima- 
tion of / in the limit n oo: 

Theorem 2 (Estimation of the median /). With the notation 
from Theorem\l\ let N be the full ellipsoid 



{z\H'\z)<E'\l + ^^ 



and let f : N ^ R be any -Lipschitz function. Then, the 
median f of f on the energy manifold Me satisfies 



/-Eat/ < A 



where O { n 2 




/y/n + ln(2an^] /{2n). 



We proceed by discussing a simple example. Suppose we 
have a bipartite Hilbert space A(E} B with dimensions = 3 
and large, but arbitrary \B\, and the Hamiltonian 



1 



H = 



1b =■■ Ha «) li 



(3) 



We fix the arbitrary energy value = |, and draw a state 
e B randomly under the constraint {ip\H\ip) = E, 
which is Ti:{iI'^Ha) = E. What does Theorem[T]tell us about 
concentration of measure for this manifold of quantum states? 
To have all positive eigenvalues, our offset s must be s > — 1, 
and the shifted harmonic mean energy becomes 



1 



1 



1 



1 



3 + s 



The offset s (equivalently, the constant e) is not specified 
uniquely by Theorem[Tl we try to find a good choice by fixing 
e independently of ??. After some trial-and-error, e = 2 turns 
out to be a good choice (other values work as well, but not 
e = 1). The next task is to estimate the shift s which results 
from our choice of e = 2; it is determined by the equation 

where n = 3|i?|. It is difficult to solve this equation directly, 
but it is easy to see that a solution close to (—4 + V7)/3 ~ 
— .45 exists for large n. This fact helps to gain a rough esti- 
mate of s which is sufficient to prove strong concentration of 
measure: denote the difference of the left- and right-hand side 
by /„(s), then /„ (-i) > for all n > 8193. Since is 
decreasing, we get fn{x) > for all x G {—1 
s > 



i. The constant 



i], hence 



cis) 



3(1 + s) 
32 (I + s) 



is increasing in s, hence c > c(— = and similarly 
ELx/E' < I- On the other hand, since /„(0) < for all 
n € N, we have s < 0. Hence we have to consider the expres- 
sion E''^ / (e'^E'q) only in the relevant interval s S (—^,0), 
where it is decreasing and thus upper-bounded by |||. Con- 
sequently, a is positive and satisfies a < 30830. Substituting 
these expressions into Theorem[Tl we get the following result: 

Example 3. Drawing random pure state vectors lip) under the 
constraint (tplHlip) = |, where H is the observable defined 
in eq. Q, we get the concentration of measure result 

Prob{|/(V') - /I > At} < 30830 n5e-A"(*-^)'+^^ 
for every X-Lipschitz function f and all n > 8193. 

It is clear that the amount of measure concentration that we 
get from Theorem [T] depends sensitively on the spectrum of 
the Hamiltonian H. In particular, not all natural Hamiltoni- 
ans yield a non-trivial concentration result. For example, in 
Section|IIIl we show that for a sequence of m non-interacting 
spins. Theorem [T] does not give a useful concentration result 
in the sense that the corresponding concentration constant c 
in ([T]i will be very close to zero. However, we will also prove 
that this is not a failure of our method, but reflects the fact 
that there simply is no concentration in that case, at least no 
concentration which is exponential in the dimension. 

In the "thermodynamic limit" of large dimensions n, the 
condition on the energy E in Theorem [T] becomes Emi„ < 
E < Ea- From a statistical physics point of view, E,j^i„ is the 
ground state energy of "temperature zero", while Ea corre- 
sponds to the "infinite temperature" energy. Hence, the con- 
dition on E can be interpreted as a "finite temperature" condi- 
tion. However, this condition is no restriction: if one is inter- 
ested in concentration of measure for Ea < E < i^max, then 
the simple substitution H i—> —H and E i-> —E will make 
Theorem[T] applicable in this case as well. 

In the situation of Example [3] above, with Hamiltonian (O 
on the bipartite Hilbert space B with fixed |yl| and large 
we may ask what the reduced density matrix ip^ typically 
looks like. It is well-known |3;] that for quantum states without 
constraints, the reduced density matrix is typically close to 
the maximally mixed state. To estimate the typical reduced 
state in our case, we may consider the Lipschitz-continuous 
functions fi.j{ip) := ("0^) that is, the matrix elements of 
the reduced state. Theorem^ gives a way to estimate these 
matrix elements by integration over some ellipsoid. 

Instead of doing this calculation directly, we give a general 
theorem below which gives the typical reduced density matrix 
in the more general case that the global Hamiltonian H can be 
written 



H^Ha + Hb. 

i.e., if it describes two systems without interaction. 
Hamiltonian (O corresponds to the special case Hb = 
this case, we get: 



The 
0. In 
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Example 4. Random state vectors \ip) under the constraint 
{'il>\H\%li) = |, where H is the observable defined in typi- 
cally have a reduced density matrix close to 



Pc = 



1 

12 



5 + V7 
2(4-77) 
-1 + V7 



More in detail, we have for allt > Q and n > 8193 



Prob 



> 3V8 + ) !> < 369960nt 



This example is a special case of our third main theorem: 

Theorem 5 (Typical reduced density matrix). Let H = Ha + 

Hb be an observable in the Hilbert space 

Cl^l of dimension n ^ \A\ ■ \B\ with \A\,\B\ > 2. Denote 

the eigenvalues of Ha and Hb by {Ef\^.^^ and {E^]}^^, 
and the eigenvalues of H by Eki := E^ + E^ respectively. 
Suppose that the assumptions of Theorem Q] hold, and adopt 
the notation from there, in particular, E'f. := Ek + s with 
the energy offset s specified there. Then, the reduced density 
matrix := Tr^ |V')('0l of random pure state vectors \ip) € 
A® B under the constraint {ip\H\ij)) = E satisfies 



Prob 



> 



1,2 X 



^\A\{t + 5)^ < \A\i\A\ + l)ani 



for all t > 0, where the "canonical" matrix pc is given by 



Z^fe=l E' ^ 



n + 1 



l^k=l E' 



V 



E' 



and the constant 5 equals 




O In 2 



To illustrate the use of Theorem|5] we give a proof of Exam- 
ple ID We use the notation and intermediate results from the 
proof of Example[3] The energy shift s depends on the dimen- 
sion n, i.e., s = s{n), and we have lim„_s.oo s{n) = — ii^. 
More in detail, if we use Mathematica to compute the Taylor 
expansion of s{n) at n = oo, we find the inequality 



-4 + 77 _ 4(35 + 16V7) ^ ^^^^ ^ -A + Vl 
3 63y/n 3 



(4) 



for all n e N. The typical reduced density matrix that Theo- 
rem|5]supplies depends on n. It is 



3(71)) n 
3 



/ 



V 



l + s(n) 









2+s(n) 

1 



3+s(ri) 



which tends to the matrix pc from the statement of Example|4] 
as n 00. We can use eq. (IDi to bound the difference be- 
tween Pc"' and Pc- Using that is decreasing in s, while 
and are increasing, a standard calculation yields 



< 



(5) 



for all n > 829. Similar calculations can be used to bound 
the constant 5 from above: 5 < -11= for all n > 550. Thus, 



according to Theorem |5] it holds 



Prob 



r-p. 



(") 



> 



58 



< 369960n5 x 

M"(*-4k)^+4v^ 



for all n > 8193. The estimate Q together with 3\/8 • 



4= < 378- 



□ 



proves the claim in Example |4l 
An interesting aspect of Theorem |5] is that the typical re- 
duced density matrix does not maximize the entropy locally (if 
so, it would be the Gibbs state corresponding to Ha)- This is 
expected to have applications in quantum information theory 
in situations where random bipartite states with non-maximal 
entanglement are considered. It will be shown in Section ITVl 
below that the reduced density matrices maximize a different 
functional instead which is related to the determinant. 



where O 



ln[2an 2 
2n 



The "canonical" matrix pc given above does not immedi- 
ately have a useful physical interpretation. In particular, it 
is not in general proportional to cxp{—/3Ha), i.e., it is not 
necessarily the Gibbs state corresponding to Ha as one might 
have expected based on intuition from statistical mechanics 
(we discuss this point in more detail in Section HIH below). 
Note also that pc is not exactly normalized, but it is close 
to being normalized (i.e., Tr ~ 1 in large dimensions n, 
which follows from E' w E'^). 



m. IMPLICATIONS FOR STATISTICAL MECHANICS 

Recently, the concentration of measure phenomenon has at- 
tracted a considerable amount of attention in the context of 
quantum statistical mechanics. Consider some ensemble of 
quantum states, such as the set of all pure quantum states 
in a certain subspace of the global Hilbert space. The sub- 
space might be given, for example, by the span of all eigen- 
vectors corresponding to an energy in some small interval 
[E — AE, E + AE] with respect to a given Hamiltonian H. 

Suppose we are given a single, particular, random state vec- 
tor \^) from this subspace. What properties will this pure state 
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have? At first one might be tempted to think that there is very 
little knowledge available on the properties of the state, and 
that most properties should turn out to be random. However, 
the concentration of measure phenomenon shows that this is 
not the case - almost all the possible state vectors {t/j) will 
have many properties in common. 

Several authors llg UlsIl have recently argued that this prop- 
erty of measure concentration may help to better understand 
certain foundational issues of statistical mechanics. Concep- 
tually, statistical mechanics aims to predict outcomes of mea- 
surements on systems even in the case that we have only very 
limited knowledge about the system (say, we only know a few 
macroscopic variables). Ensemble averages are employed to 
make predictions, and the predictions agree very well with 
experiment even in single instances of the system. Concentra- 
tion of measure is then viewed as a possible theoretical expla- 
nation of some aspects of this phenomenon. 

As a paradigmatic example, Ref. ifioll considers the situa- 
tion of a bipartite Hilbert space Hs ®T-Le, consisting of sys- 
tem S and environment E. Then the setting is investigated 
where the set of physically accessible quantum states is a sub- 
space T-Lr C T-Ls ®T-Le (for example, a spectral windows 
subspace as explained above). If we are given an unknown 
random global quantum state vector jV') € "Hr, then what 
does the state typically look like for the system S alone? In 
this case, the postulate of equal apriori probabilities from sta- 
tistical physics suggests to use the maximally mixed state on 
T-Lr, that is l/dim('Hfl), as an ensemble description. Then, 
taking the partial trace over the environment will yield a state 
Q,s ~ Tr^ 1/ dim('H_R) which may be used by observers in 
S to predict measurement outcomes. 

According to Ref. ifioll . concentration of measure in the 
subspace T-Lr proves that almost all quantum state vectors 
IV') S "Hr have the property that the corresponding reduced 
state i/j^ := Tie IV') (V'I " very close to fig. That is, sa fie 
with overwhelming probability. It is then argued in Ref. llioll 
that this result explains why using the ensemble average fl^ 
is in good agreement with experiment even in the case of a 
single instance of the physical system. 

While Ref. ifToll considers a very general situation that does 
not allow for specifying directly what the "typical" state ils 
looks like, Ref. ^ make additional assumptions that allow to 
specify O5 in more detail. That is, if the Hamiltonian H is 

H = Hs + He 

(that is, there is no or negligible interaction between system 
and environment), if the restriction Ti. r is given by a spectral 
energy window, and if the bath's spectral density scales expo- 
nentially, then ils ^ cxp(— for some suitable /3 > 0. 
That is, under these standard assumptions from statistical me- 
chanics, Ref. argues that the typical reduced state is a 
Gibbs state. 

These results raise an immediate question: what happens 
if the restriction is not given by a subspace? In statistical 
mechanics, one often considers the situation that an observer 
"knows the total energy of the system", but does not know 
the exact microscopic state. In all the papers previously men- 
tioned, the intuitive notion of "knowing the energy" has been 



translated to the technical statement of "knowing with cer- 
tainty that the quantum state is supported in a spectral sub- 
space". Obviously, there is at least one natural alternative: 
knowing the energy might also be read as "knowing the en- 
ergy expectation value {il)\H\il)) of I'i/')"- 

In fact, this possibility has been proposed by several au- 
thors lfl9l - l2lll as a possible alternative definition of the "quan- 
tum microcanonical ensemble" (we call it "mean energy en- 
semble"). The results in this paper give some information on 
the applicability of the mean energy ensemble in statistical 
mechanics: 

1. As a positive result, we know from Theorem [T] that the 
concentration of measure phenomenon occurs for the 
mean energy ensemble as well, if that theorem is appli- 
cable to the particular Hamiltonian H and energy value 
E that is considered. In this case, many interesting re- 
sults from Refs. ll9l- [l3ll carry over to the mean energy 
ensemble. 

2. On the other hand, the ensemble does not seem to re- 
produce well-known properties of statistical mechan- 
ics, such as the occurrence of the Gibbs state (cf. Theo- 
rem|5]l. Hence it seems to describe rather exotic physi- 
cal situations. 

There is an intuitive reason why the mean energy ensemble 
behaves exotically: calculations involving Theorem |2] sug- 
gest that the fc-th energy level \Ek) typically has an occu- 
pation proportional to |('i/'|-Bfe)P ~ l/^I-' where E'^. is the 
corresponding shifted energy value (cf. Theorem [U; this is 
also visible in the form of the typical reduced density ma- 
trix pc in Theorem |5] In particular, typical state vectors jV') 
"spread out" a lot on the small energy levels. This produces 
a "Schrodinger cat state" which is in a coherent superposition 
of many different energy states. Such states are not normally 
observed in statistical physics. 

However, point 2. does not completely rule out the mean 
energy ensemble as a description of actual physics, due to the 
following fact: 

3. Our result is tailor-made for systems with the property 
that the corresponding mean energy ensemble concen- 
trates exponentially in the dimension n, similarly as in 
Levy's Lemma (corresponding to inequality (|6]l below 
with K{n) = const.). However, many natural many- 
body systems do not have such a concentration prop- 
erty, which is why Theorem [T]|2] and |5] do not apply in 
those cases. 

As an example, we will now show that the mean energy en- 
semble does not concentrate exponentially in the case of m 
non-interacting spins. Let the energy levels of each spin be 
and +L The total Hilbert space has dimension n = 2™. 
If k is an integer between and m, then the energy level k 
is (™)-fold degenerate. Moreover, suppose that we are inter- 
ested in the energy value E ~ am, where a e (0, i). To see 
if Theorem[T]is applicable, we determine a rough estimate of 
the energy shift s that has to be employed such that E' w E'^ . 
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We thus have to find s > such that 



k=Q 



ni\ 1 



k J k + s 



am + s; 



this is only possible if the (fc = 0)-term gives a significant 
contribution. The conclusion is that s must be very small, that 
is, of the order s w q;to2~™ (and this conclusion is confirmed 
by more elaborate large deviations arguments similar to those 
discussed below). But then, the constant c in the exponent 
in ([Til is approximately 



3£'min 

32E' 



3 

32 



am2 ™ 



32 



3 

32 



1 

n 



such that Theorem[T]does not give any measure concentration 
at all. 

It turns out, however, that this result is not a failure of our 
method, but reflects the fact that there simply is no concen- 
tration of measure which is exponential in the dimension n 
in this case. The precise statement below makes use of the 
binary entropy function 

H{l) = -7log2 7- (1 -7)log2(l -7) 
defined for 7 G [0, 1] (c.f. B ). 

Example 6 (Non-interacting spins: no exp. concentration). 

Suppose we have m non-interacting spins as explained above, 
and fix the energy value E = am with a G (0, i). Consider 
a hypothetical concentration of measure inequality 



Prob{|/-/| >0 < b-e 



(6) 



for all 1-Lipschitz functions f. Here, f denotes either the 
mean or the median of f, n = 2™ is the dimension of the 
system and b > a fixed constant. If such an inequality is to 
hold, then necessarily 



(7) 



for any /3 > a. In particular, the optimal exponent in ([7| is 
strictly smaller than one if a ^ ^ and goes to zero oi a — > 0. 

Proof. Let 7 be such that a < 7 < i. Let 



{i|Bi<7m} {i\Ei>jm} 



Normalization gives L + R = 1 from which we get 

am = E = ^]E[\ip^\'^] E, > 7m i? = 7m(l - L) 



^ L > 1 - 



a 



(8) 



reflections \tfj) 1— )■ — both expectation value and median 
of this function equal zero. If hypothesis (|6]l is true, then it 
follows by squaring that 



Prob{(3fiV'i)^ > u} < 6 - e 



-K(n) u 



and thus 



iia„Prob{(3fJV',)^ < u}du 



uduProhKditl^if > u} du 



ti=0 



Prob{(3?V'0^ > u}du 



(9) 



having used integration by parts. An analogous inequality ob- 
viously holds for the imaginary part ]E[(9?/'i)^J. From basic 
information theory (e.g.. Chapter 11 in Ref. 112511 ), we bor- 
row the fact that the number of terms in the definition of L is 
upper-bounded by 



\{i\Ei< 7m} I < 2'"-f^(T)+i°g2™. 
Combining Q and ( fTOl i: 

2^ r>m(g(7)+^loK,m) > ^ > _ ^ 

K(n) ~ ~ 7 



(10) 
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K{n) < —n 



ff(7)+^ logs'" ^ o(n^(^)) 



for every /3 > 7. □ 
It is highly plausible that similar results hold for the mean 
energy ensemble of many other many-body systems: the best 
possible rate of concentration (such as the right-hand side 
in (|6]l) is not cxp(— cn<^), but at most cxp(— cri^'t^) with 
energy-dependent exponent p < 1. Deciding whether this up- 
per bound can be achieved remains an interesting open prob- 
lem. Indeed, we end the present section by sketching a possi- 
ble route for tackling this question. 

The proof of Example|6]uses the coordinate functions t/j i-> 
2S example for continuous functions without strong 
concentration properties. We conjecture that this is already 
the worst case, i.e. that no function with Lipschitz-constant 
equal to one "concentrates less" than the "most-spread out" of 
the coordinate functions. 

A strongly simplified version of this conjecture is easily 
made precise. We restrict attention to real spaces and linear 
functions 



1/) i-> ^ Qi/'i 



We can also bound L using inequality To this end, con- 
sider the 1-Lipschitz function \ip) 1— > diijji, that is, the real part 
of the i-th component of \tp) in the Hamiltonian's eigenbasis. 
Due to the invariance of the energy manifold with respect to 



of the state vectors. Here, the are the coefficients of 
in the eigenbasis of the Hamiltonian and the c^'s are arbitrary 
coefficients subject to the normalization constraint J^i = 
1. The latter constraint ensures that the Lipschitz constant of 
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the linear funtion is one. If the vectors ip are drawn from an 
energy ensemble Me, we have the elementary estimate 



i i,j i 

< max E [i/jf ] = max Var [i/^i ] , 



having made use of the fact that Me is invariant under the 
transformation which takes ipi 1-^ — -0^ for some i while leav- 
ing the other coordinates fixed. 

Upper bounds on the variance of a random variable are suf- 
ficient to establish simple concentration estimates (by means 
of Chebychev's inequality). From that point of view, we have 
proven that coordinate functions show the least concentration 
among all linear functions, according to methods based on 
second moments alone. Generalizing this observation to more 
general functions and higher moments would allow us to re- 
strict attention to the eigenbasis of the Hamiltonian, thus "tak- 
ing the non-commutativity out of the problem". Conceivably, 
this would constitute a relatively tractable path to a more com- 
plete understanding of concentration in typical many-body 
systems. 



IV. INVITATION: A SIMPLE HAMILTONIAN 

As a preparation for the proof in the next section, we give 
a particularly simple example of a Hamiltonian which admits 
a more direct proof of concentration of measure; in the mean- 
time, we will also see that the "harmonic mean energy" from 
Theorem [T] appears naturally. Consider a bipartite quantum 
system on a Hilbert space H = Ha <E)'Hb, with dimensions 
1^1 dimH^ and := dim We may assume without 
restriction that \A\ < \B\. We are interested in the manifold 
of state vectors \tjj) £ H with average energy {ip\H\tl.') = E, 
where H = W is some Hamiltonian on H. What happens if 
we draw a state \ip) from that submanifold at random? Instead 
of studying this question in full generality (which we will do 
in Section[V]l, we start with the simple example 

H = Ha®1, 

where Ha = h\ is an observable on Ha alone. This is 
a special case of a more general bipartite Hamiltonian H = 
Ha + Hb — Ha Cgi 1 + 1 (8) Hb without interaction as studied 
in Theorem|5] A nice consequence is that 

(0|i?|0) =E^ Tt{^P^Ha) = E, 

that is, the constraint depends on the reduced density matrix 
■0^ := Tib alone. 

The unitarily invariant measure u on the pure quantum 
states on the global Hilbert space H induces in a natural way 
a measure va on the density matrices in Ha- if -S* is some 
measurable subset of the density matrices, then 

i^AiS) :=iyim I Ti-B 10X^1 e 5}) = I. (Tr^^^)) . 



Formally, the measure i^a is the pushforward measure 12611 of 
the unitarily invariant measure v on the pure states of Ha ® 
Hb with respect to the map Tr^; that is, va = (Ti's)^ {v). 



Due to the simple form of the Hamiltonian H, we may 
calculate probabilities with respect to va- The probability 
density distribution corresponding to va is invariant with re- 
spect to unitaries on Ha, but it depends on the eigenvalues 
t = {ti, . . -j^Al) of the reduced density matrix ip^. The re- 
lation is iBlHl: 

/ \A\ \ \A\ 

\ / »=1 i<j<\A\ 

(11) 

where z is the normalization constant and dt — dti . . . dt\A\- 
We will now study the simplest case \A\ = 2, where 



Ha = 



El 
E2 



and we may assume without loss of generality that Ei > i?2- 
We fix some arbitrary energy value E between Ei and E2. 
Since the average energy Ea '■= \{Ei + E2) is the energy 
of an "infinite temperature" Gibbs state, we additionally as- 
sume that E < Ea, which is no restriction, but saves us some 
distinction of cases. Thus, E2 < E < Ea < Ei. 

The state space is now the Bloch ball; that is, the unit ball 
in M.^. The distribution d^A has been calculated for the case 
\A\ =2 already by Hall 12811 . In ordinary spherical coordi- 
nates, it can be written 

dvAir) = p\B\{r)r'^dr dipdO, 

where p\B\i'>') = c_b(1 — r^)'^'^^, and cb is some normal- 
ization constant. It can be derived from eq. ( fTTT ) by using 
that points r in the Bloch ball correspond to density matri- 
ces with eigenvalues ti = 5(1 + ||f ||) and t2 = 5(1 — \\r\\). 
If \B\ = 2, this measure becomes the usual Euclidean mea- 
sure in the Bloch ball, corresponding to the Hilbert-Schmidt 
measure ll27ll . In general. 



P\B\{r)r^drdLpde^ / // piB\{\x\)dx 



MS) 



We write pb{x) := for.T e M.^. 

It is easy to see (and we will show this below) that the subset 
of mixed states in the Bloch ball with Tr(V''^i7^) = E' is 
the intersection of a plane with the Bloch ball; that is, a disc 
Ke- To determine probabilities, we need to compute the area 
p{X) of two-dimensional subsets X C Ke- In light of the 
density pB introduced above, it is tempting to use the term 
II X Pb{x) dx as the measure of X. The normalized version 
is a probability measure: 



Prob(X) 



JJ^PBix)dx 
II PB{x)dx' 



(12) 



It turns out that this measure agrees with the normalized ge- 
ometric volume measure Prob that we use elsewhere in this 
paper (for example in Theorem[T]). We discuss this fact in de- 
tail below after the proof of Proposition]?] Using this identity 
and deferring its justification to below, we get a concentration 
of measure result: 
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Proposition 7. The reduced density matrix ip"^ 
concentrates exponentially on the canonical state 



Pc 
that is, 

Problll^/.-^-p,!!! >e} <exp 
for all e > 0. 




A{Ei - E){E-E2) 



Proof. It follows from the condition "Iv{iI^^Ha) = E that 
must have diagonal elements 



E~E2 
E-i — E2 



do := 



El- E 



where di < d2- By Schur's Theorem 112911 . the eigenvalues 
ti and <2 of il)^ must majorize the diagonal elements, that is, 
((i2,di) -< {t2,ti), hence t2 ~ ||V''^l|oo > ^2- Thus, among 
all states t/;^ satisfying the energy condition, the canonical 
state Pc is the "most mixed" one, in the sense that its radius in 
the Bloch ball is the smallest possible. 

Geometrically, this has the following interpretation. Trans- 
lating the condition Tt{-4'^Ha) — E to the Bloch ball repre- 
sentation, that is, representing density matrices by vectors 



r e 



. we get 




2{E-Ea). 



This defines a plane with 



El — E2 

in M.'^; the intersection of that plane with the Bloch ball gives 
the set of density matrices that fulfill the energy condition. 
This set is a disc Ke, with the vector representation of pc 
at its center. Since the trace distance || • |ji on the density 
matrices corresponds to the Euclidean distance in the Bloch 
ball, the set of states tp^ satisfying the energy condition with 



> e corresponds to an annulus in Ke with inner 



radius e; we denote this annulus by Ke{s). Hence 



Prob 



Pc\\i > e} 



ILeK^(e) PB{x)dx 

ILeK^io) PB{x)dx' 



By elementary integration, we get 



PB{x)dx 



27r r\P--ri I SCOSf 

dtp j ds ■ spB I s sin tp 



\B\- 



2nIs|-i 



Since Ke = Ke{0), this yields 
Prob{||7A^-p,||i >£} = 



1 2 2\ 



1 - r2 

■2 \\B\-1 



l-ri 



Substituting 



< cxp 



A(Ei - E){E - E2) 



l-ri 



{El 



proves the claim. □ 
In this paper, the measure that we are interested in is the 
Hausdorff volume measure on the energy manifold Me- 
To compute the correct probabilities when restricting to the 
Bloch ball by partial trace, we need to invoke the pushfor- 
ward of pm^ with respect to Tr^. So is the probability mea- 
sure Prob that we have defined above in eq. (fT2l i equal to this 
pushforward measure, i.e., does 



Prob = (Trs), [p 



Me 



hold? Fortunately the answer is "yes", but this is not directly 
obvious. First we observe that Prob may be interpreted as an 
"energy shell measure" in the following sense. The submani- 
fold measure from eq. ( fT2b can be given by a limit, in a spirit 
similar to the definition of the "Minkowski content" (cf. Ref. 
IH): For X C Ke, denote by {X) the set of matrices ■0^ 
that are i5-close to X, and have an energy expectation value of 
Tr(t/'^iJA) < E. This is half of the (5-neighborhood of X. 
Then, up to a normalization constant, 



Prob(X) 



PB{x)dx 



X 

lim - 

s^o 5 

1 



PB{x)dx 



= jun-^'^iTi-B'iUiiX))). 

But {Ke) is a "slice" of the Bloch ball in between two par- 
allel planes. The set consists of those matrices determined 
by the inequality E — e < Tr(-(/;^if^) < E, where e > is 
some energy difference corresponding to 5. In particular. 



{ui{KE))^m I (mm e {E-e,E)}. 



This is an energy shell. Hence it is basically the uniform distri- 
bution on this energy shell on the pure states which, by taking 
the partial trace and the limit e ^ 0, generates our probability 
measure Prob. 

In the general case of arbitrarily many different energy lev- 
els {£'A;}fc=i> this energy shell has different "widths" at dif- 
ferent points X e Me', in the limit S ^ 0, this width is pro- 
portional to \\Ps'^E{x)\\^^, where P5 denotes the projection 
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onto the sphere's tangent space at x. Hence the corresponding 
"energy shell measure" does not in general equal the geomet- 
ric (Hausdorff) measure Prob used elsewhere in this paper, 
which arises from an analogous limit procedure, but starting 
with the uniform distribution in an e-neighborhood of Me- 

However, here we are in a very special situation: we only 
have two different energy levels Ei and E2 which are highly 
degenerate. In this case, it turns out that ||PgV£'(a;)|| is con- 
stant along Me- To simplify the argument, we double the 
dimensions and work in real space R"; the case n = 2\A\ \B\ 
applies to Proposition|7]above. 

Lemma 8. Consider the real energy manifold 



M 



E ■- 



^Ekxl^E, Y.'- 



k=l 



= 1 



fc=l 



in the special case that there are only two different energy 
levels, i.e. there exist £1,82 such that Ei ~ E2 = ... ~ 
Em — £1, ond Ern+i = - . - = En = £2- Furthermore, 
assume that £i < E < £2- Let Ps denote the projection 
onto the tangent space of the unit sphere, and let E{x) \— 
'^k=i^kx'^- Then, ||P5V£'(a;)|| is constant on Me- 

Proof Direct calculation yields 

\\PsVE{x)f = AE'^{x) - 4^;(a;)2, 
where E'^{x) := Elxl- Since 

E{x)^(Y,xI\£,^(i-Y^xI\ £2 



\k=l 



k=l 



equals E on all of Me, it follows that X^feLi ^"k constant on 
Me- But then. 



E\x) 




k=l 



is also constant on all of Me- □ 
Hence, in our case of only two different energy levels, the 
energy shell has constant width everywhere, such that the 
measure defined in eq. ( fT2] i indeed agrees with the geomet- 
ric measure that we use elsewhere in the paper 

Proposition [7] shows that a typical reduced density matrix 
is close to pc\ that is, it is diagonal in i/^-basis, and it can 
be written pc = e~^"* /Z with Z := Tr(e~'^'^'*) and some 
appropriate "inverse temperature" /3 > 0. Hence it is a Gibbs 
state. 

This result suggests that the local state always concentrates 
on a Gibbs state when H = Ha (8) 1, also in the more general 
case = dim "Ha > 3. But this guess is false, as we have 
akeady shown in Theorem|5]and Example|4]- the local density 
matrix always commutes with Ha, but it is not in general the 
corresponding Gibbs state. 

In light of the calculation above, it is now easy to give an 
intuitive explanation for this fact. In analogy to the previous 
proposition for |j4| = 2, we expect that the distribution of 



eigenvalues as given in eq. ( fTTT ) will dominate the concentra- 
tion of measure to some local "canonical" state. It also seems 
reasonable to assume (and can be verified numerically) that 
the {ti — tj)^ -terms do not contribute much for large \B\, and 
that the other terms exponential in dominate. But these 
terms are 



1^1 

i=l 



\B\-\A\ 



(detV^) 



A\\B\-\A\ 



and so we conclude that the reduced density matrix should 
concentrate on the one which maximized the previous ex- 
pression. This suggests the following conjecture; 

Conjecture 9. Suppose that H = Ha ® 1 is a Hamiltonian 
on a bipartite Hilbert space % = %a ®'Hb with fixed |^| := 
dim Ha and varying \B\ := dimHs. Then typical quantum 
states \il>) g % with fixed }nean energy (iplHltp) = E have the 
property that := Tr^ concentrates exponentially 

on the determinant maximizer 

Pc := argmaxjdct-!/;^ | Tr(-0^HA) = E} , 

given the maximizer is unique- 

We will now see that the prediction of this conjecture is 
consistent with Theorem|5] To this end, we now compute the 
determinant maximizer pc explicitly. First of all, it is easy to 
see that pc must be diagonal if written in the basis of Ha, 
i.e., [pc.Ha] = 0: Suppose cr is any density matrix with 
Tv{aHA) = E, and let {c^.^l^^]^ denote the diagonal el- 
ements. Then Tr(o'iJA) = <Jk,kEk = E, where Ek de- 
notes the eigenvalues of Ha- Now if a is the matrix with diag- 
onal elements ak^k and other entries zero, then TT:{dHA] = E 
is still true, but by the Hadamard determinant theorem ll29ll 

det a < Y\ <^k,k = det a. 

k=l 

Hence the maximizer is diagonal; it remains to determine 



its diagonal elements (Ai 



\A\) 



We have to maxi- 



subject to the constraints J2k ^fe = 1 



J2k ^kEk ~ E- It turns out to be difficult to do that directly, 
so we drop the normalization condition \k = \ for the 
moment and solve the resulting equation 



d_ 



d 



where A is the Lagrange multiplier This gives A = ^, and 
A - J- 

* ^ \A\ E, 



Since this distribution is not automatically 
normalized, we can use the freedom to shift the energy levels 
by some offset s e M, i.e., E'^ := Ek + s, E' -.^ E + s- 
The resulting distribution AJ := • -^7 is normalized, i.e., 
A'; = 1, if and only if 



E', 



Hi 
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that is, if the offset is shifted such that the new energy value 
E' equals the harmonic mean energy E'^. We have thus repro- 
duced the canonical density matrix pc of Theorem |5] More- 
over, the calculation above shows that the occurrence of the 
harmonic mean energy is very natural and not a technical arti- 
fact of our proof. (In Section lVTl we give a method for numer- 
ical sampling of the energy manifold, and there, the harmonic 
mean energy will appear in a natural way as well.) 



V. PROOF OF MAIN THEOREM 

Before proving the main theorems, we fix some notation 
that will be useful for the proof. Following the lines of Gro- 
mov 1I26I1 . we define a metric measure space X to be a separa- 
ble complete metric space with a finite Borel measure p, i.e., 
X = {X, dist, p). (In fact, one could more generally consider 
Polish spaces with a cr-finite Borel measure, as described by 
Gromov.) In this paper, we will only consider the two cases 
that X is the energy manifold Me, or a full ellipsoid N, both 
equipped with the obvious geometric measure and the met- 
ric dist that is induced by the surrounding Euclidean space 
~ C" (for more details, see Section|Vj. 

We denote the (n — 1) -sphere by 5"^^, and the ri-ball is 
denoted B", i.e.. 



abusing notation, we can write 



S"-^ := {xe 



<1}. 



The symbol d is used for the topological boundary, for exam- 
ple, dB" = S"-\ 

We denote the /c-dimensional (Hausdorff) volume measure 
on fc-dimensional submanifolds M C M" by pk- In case that 
Pk{M) < 00, we write pM for the normalized measure on 
M, i.e.. 



f^AiiX) := 



MX) 



for Borel subsets X C A/. Sometimes we consider subsets 
that are not actually submanifolds, but are turned into sub- 
manifolds in an obvious way. For example, the Ball B" is 
itself a metric measure space, but not a submanifold, since it 
is not open. However, its interior is a submanifold of R", and 



TT 2 



r 1 



while pB^ {B") = 1. Expectation values with respect to pM 
are denoted Ej\/ . 

Given some Hamiltonian H = W on C™, we would like 
to prove concentration of measure for the "mean energy en- 
semble" 

Me H' e C™ I (MHW = E, || Vl| = 1} . 

To relate our discussion to real-valued geometry, we work in- 
stead in K" with n ~ 2m. That is, doubling all energy eigen- 
values of H to get the energy levels Ei, . . . , En, and slightly 



Me 



X e 



k=l 



E, 



k=l 



Geometrically, Me is the intersection of the unit sphere with 
an ellipsoid (given by the energy condition). The action of 
shifting all energies by some offset (as postulated in Theo- 
rem[Tl) alters the ellipsoid, but not the ellipsoid's intersection 
with the sphere; it leaves Me invariant. It is interesting to 
note that the corresponding full ellipsoid's volume turns out 
to be minimal exactly if the energy shift is tuned such that 
E' = E'ff, i.e., for the harmonic mean energy shift close to 
the one which is postulated in Theorem[T] 

We would like to prove measure concentration for the alge- 
braic variety Me in ordinary Euclidean space. Introducing a 
function / : M" -> 



via 



\k=l 



we can write 



Mp 



Y.^lEk 

k=l / 



If E is any energy value with niin^ E^ < E < max^ E^ and 
E 7^ Ek for all k, then the differential df has full rank on 
all of AIe, such that Me is a proper submanifold of M" of 
codimension 2. 

If E = Ek for some k (but E 7^ minj; Ek and E ^ 
maxfc Ek), the eigenvectors corresponding to this energy 
value are singular points of /. Still, removing those eigen- 
vectors from AIe, we get a valid submanifold Me of M" of 
codimension 2. Since AIe and AIe agree up to a set of mea- 
sure zero, we will drop the tilde in the following, and simply 
write Me for the manifold with eigenvectors removed. If we 
treat Me as a metric measure space, we include the eigenvec- 
tors in its definition to have a complete metric space. 

As a submanifold of R", the sets Me carry a natural ge- 
ometric volume measure. Since every Me is a compact sub- 
manifold, it makes sense to talk about the normalized measure 
Ma/e on AIe, and to ask whether this measure exhibits a con- 
centration of measure phenomenon. Our main proof strategy 
to answer this question in the positive is due to Gromov luol . 
To explain this strategy, we introduce the notion of a "typical 
submanifold". Suppose that iV is a metric measure space, and 
AI C is a subset (say, a submanifold) which is itself a met- 
ric measure space. Then we say that "il/ is typical in A^" if 
AI has small codimension, and if a small neighborhood of M 
covers almost all of N. That is, pi^[{U^{M)) « 1 akeady for 
small e. 

For example, given an n-dimensional sphere N = S", any 
equator M (which is itself an (n — 1) -dimensional sphere) is 
typical in N; this is just Levy's Lemma. On the other hand, a 
polar cap with angle 9 is only typical in S*" if 6' w ^. 

Gromov's idea can now be explained as follows: 
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If A'^ is a metric measure space that shows concentration 
of measure, and if M is a typical submanifold of N, 
then M shows measure concentration as well: it "inher- 
its" concentration of measure from N . 

The intuitive reason why this idea works is as follows. Con- 
sider the behavior of Lipschitz-continuous functions on M C 
N . By continuity, those functions do not change much if we 
turn to an e-neighborhood of AI; but then, since M is assumed 
to be typical in N, we already obtain almost all of N, and so 
the behavior of the functions on M will be similar to that on 

- in particular, expectation values will be similar, and the 
concentration of measure phenomenon will occur in M if it 
occurs in N . However, Gromov seems to explore this idea in 
his book only for the case that iV is a sphere. 

In our case, we have to find a submanifold N C K" which 
is itself subject to the concentration of measure phenomenon, 
such that AIe C N holds and such that Me is typical in N . A 
first obvious guess is to use the sphere itself; clearly, A/^ is a 
subset of the sphere 5"^^ C M", and we have concentration 
of measure on the sphere by Levy's Lemma. However, Me 
can only be typical in S'""^ if the energy E is close to the 
"typical" value Egn-i ^t-^k, which turns out to be the 

mean value -^{Ei + E2 + ■ ■ ■ + En). Since we definitely want 
to consider different energies far away form the mean energy, 
N = 5"^^ is not a useful choice. Instead, it will turn out 
that we can choose N to be a full ellipsoid with appropriate 
equatorial radii (in fact, N will be the slightly enlarged full 
energy ellipsoid for an appropriate energy shift). 

Our main tool from integral geometry is the Crofton for- 
mula llsollsill . It expresses the volume of a submanifold of M" 
in terms of the average volume of intersection of that manifold 
with random hyperplanes. 



Lemma 10 (The Crofton formula |l30tl)- Let M be a q- 

dimensional submanifold o/R". Consider the invariant mea- 
sure dLr on the planes of dimension r in M". Ifr + q > n. 



Pr+q-n{M H L,.) dL,. = cr(g, r, n)^iq{M), 



where a{q,r,n) 



q—n 



n 



t—n—r 



.0, 



and Or, := 



2ir 2 



denotes the surface area of the n-sphere. 



Note that the Crofton formula is formulated in Ref. JSC 
(14.69)] only for the case that M is a compact submanifold, 
but the proof remains valid also in the case that M is not com- 
pact. This observation is also expressed in Ref. Isill . 

For the details of the definition of the invariant measure 
dLjj, see Ref. Isoll . In short, the Lie group 9Jl of all mo- 
tions (translations and rotations) in R" possesses the closed 
subgroup f^r of motions leaving a fixed r-dimensional plane 
L° invariant. Then, there is a one-to-one correspondence be- 
tween the set of r-planes in M" and the homogeneous space 
^Sl/^r- Since both 9Jl and S)r are unimodular, dJl/Sjr pos- 
sesses an invariant density dLr which can then be interpreted 
as a density on the 7--planes in M". 



Note that this measure is defined only up to some multi- 
plicative constant - different authors use different normaliza- 
tions (cf. Ref. ll32ll ). which gives different constants a{q, r, n) 
instead of (T{q, r, n) in Lemma [TOl However, for fixed I and 
n, we always have 

a{k,l,n) d'{k,l,n) 
<j{k' ,l,n) a{k',l,n)' 

and it is only those ratios that are relevant for the calculations. 
If r + g = n, then pr+q~n{X) = P-oiX) equals the number 
of points in the set X. A useful possible expression for the 
constants is ll32ll 



r(^)r(r±i) 



a-((j,r, n) 



r(H±i)r( '^+''~"+^ ; 



The Crofton formula will be useful in the following lemma, 
which gives a lower bound on the measure of e-neighborhoods 
of subsets of Me- Given any subset X C K." (say, any curve), 
directly estimating /i„([4(X)) seems difficult - if the curve 
intersects itself many times, the neighborhood can be almost 
arbitrarily small. On the other hand, the Crofton formula takes 
into account how "meandering" subsets X are, by counting 
the number of intersections with hyperplanes. 

Lemma 11 (Measure of neighborhood). For every open sub- 
set X C Me C M" with n > 3 and e > 0, we have 



lin{U,{X)) > 



Tie 



-nn-2{X), 



4(n- 1)' 

where U^{X) denotes the open e-neighborhood in R" of X. 

Proof. If L2 is any 2-dimensional plane in R", then AIe H 
L2 = (dN n L2) n (5"" n L2), where dN is the surface of 
an ellipsoid. If the intersection is not empty, then S"" n L2 is 
a circle, and dN n L2 is a compact quadratic curve, hence an 
ellipse. By the Crofton formula. 



Po{Me n L2)dL2 = a{n - 2,2,n)pn~2{ME) < 00, 



so the set of planes L2 with pq{Me L2) = ij^{AlE H L2) = 
00 has measure zero, and we can ignore them. Let now Lx be 
the set of all planes L2 such that X n L2 is a finite non-empty 
set. Since a circle and an ellipse in the plane intersect in at 
most four points if they are not equal, we have ^{Xr\L2) < 4 
for all L2 G Lx ■ Hence, the Crofton formula yields 

a{n-2,2,n)p,,^2{X) = f fio{X L2) dL2 

= f #{XnL2)dL2 

JLx 

< 4: I dL2. 
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Using Crofton's formula for the n-dimensional submanifold 
U^{X), we get on the other hand 

a(n,2,nK([/e(X)) = / /i2([/s(X) n ia) ^^2 

> [ fi2iU,iX)nL2)dL2 

JLx 

> ne^ / dL2, 

Jlx 

since every plane that intersects X intersects Ue{X) in at least 
a disc of radius e. Combining both inequalities, the claim 
follows. □ 
To deal with concentration of measure on submanifolds, 
we need to introduce some additional notions of the theory 
of measure concentration; they all can be found in the book 
by Gromov ll26ll . and also, e.g., in Ref. ('33] with a few errors 
corrected. 

Let {Y, v) be a metric measure space with < to := 
v{Y) < oo, then the "partial diameter" diam is defined by 

diam(j/, TO— k) := inf{diam(yo) I C Y,v{Yo) > m — k}, 

that is, the smallest diameter of any Borel subset with mea- 
sure larger than m — k. If {X, fj) is a metric measure space 
with TO := fi{X) < oo, the observable diameter ObsDiam is 
defined as 

ObsDiam(X, k) :~ sup |diam(/i o /~^, to — k) | 
/ : X ^ Mis 1-Lipschitz} , 

where /i o /^^ is the push-forward measure on f{X) C M. In 
the special case that /i = ^x, i-C-, if M is normalized such that 
m = 1, this definition implies 

ObsDiam(X, k) < D ^ Vf : X 1-Lipschitz 
Ve > 03Yo C f{X) c R with ^ix (/"^(i^o)) > 1 - k 

and diam(yo) < D + e. (13) 

This shows that small ObsDiam amounts to a large amount of 
measure concentration - in fact, one can conversely infer that 
ObsDiam(X, K)<L'forD>0,0<K<i impHes that all 
A-Lipschitz maps f : X ^ M. satisfy 

^ix{xeX : \fix)-mxf\>XD}<K, (14) 

where mxf is the median of / on X, see Ref. ll26ll and, for a 
proof, Ref. ll34l Lemma 2.3]. (Replacing k by 2k on the right- 
hand side removes the restriction k < Another useful 
notion is the separation distance 

Scp{X; Ko, ■ • ■ , kn) := sup{S \ 3Xi C X : ^i{Xi) > Ki, 
A\si{X^,X■j) > 6, Xi open}. 

In Ref. ll26ll . arbitrary Borel sets are allowed in the definition 
of the separation distance; here, we only use open sets, since 
they are submanifolds and hence subject to the Crofton for- 
mula. In the most important case of two parameters, the equa- 
tion Scp(X, K, k) ~ D implies that for every e > there are 



open subsets Xi,X2 C X with i^{Xi) > n and ^{X2) > n 
such that dist(Xi, X2) > D - e. 

In the following, we need an inequaUty relating separation 
distance and observable diameter. It has first been stated in 
Ref. ll26ll . and a small lapse has been corrected in Ref. ll33ll . 
Since the notation of Ref. ll33ll differs significantly from the 
notation used here, we give a proof in order to keep the pre- 
sentation self-contained. 

Lemma 12 (Observable diameter). For every metric measure 
space X, and for every k > k! > Q, it holds 

ObsDiam(X, 2k) < Sep(X;K,K) < ObsDiam(X, k ). 

Proof. Suppose that ObsDiam(X; n') < D. Let Xi d Xhe 
an open set with px {Xi ) > k, and define a function f : X ^ 
R via 

f{x) := dist(a;,Xi) = inf dist(x,?/), 

where d denotes the metric on X. It follows from the triangle 
inequality that / is continuous with Lipschitz constant 1. Let 
£ > 0. According to ( fT3] l. there is a subset Iq C M with 

diam(yo) < D + e such that fix{x G X \ f{x) G Fo} > 
Since 1-k' + k> 1, it follows that Xin/-i(yo) 7^0, 
hence we have G Yo- Similarly, if X2 C X is another open 
set with px{X2) > k, it follows that X2 n /^H^o) 0' so 
there is some x G X2 with f{x) G Yq. Thus, 

dist(Xi,X2) = inf f{x) < swp y ^ diam(yb) < D + e. 

Since e > was arbitrary, it follows that dist(Xi, X2) < D, 
and thus Scp(X; k, k) < D. This proves the second inequal- 
ity. 

To prove the first inequality, suppose that 
ObsDiam(X, 2k) > D. This means that there exists a 
1-Lipschitz function f : X ^ R such that for all Yb C M 
with 

f^x{x G X I fix) eYo}>l~2K 

it holds diam(Fo) > D. Clearly, the function M{a) := 
px (/""'^ ((— 00, a])) is increasing in a G M. Let aq := 
inf{a G R | M(a) > k}, then 

l^x (/"^ ((-00, ao]) U /"^ {{ao,a])) > k 

for all a > oq. Since every finite Borel measure on a 
Polish space is regular Issl Ulam Theorem], it follows that 
/ix (/^^ ((— 00, Oo])) > K. Similarly, define N{b) := 
lf-^ {[b, 00))), and let 

60 := sup{6 G M I N{b) > k}. 

An analogous argument shows that px (/^^ ([60,00))) > 
K. Moreover, ^x (/~^ (('2, 6))) > 1 — 2k if a < oo 
and b > bo. Due to the regularity of px, we conclude 
that fix (/^^ (Yo)) > 1- 2k for Yq :== [ao,&o]- Setting 
Yi :— (— cx),ao] and Y2 := [60,00), we get dist(Yi,y2) > 
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diam(yo) > D. Letting Xi := f-'^{Yi) for i = 1,2, it fol- 
lows from the Lipschitz continuity of / that dist(Xi, X2) > 
D, and ^x{Xi) > k. Now if e > is arbitrary, the 
open sets Xi Ue{Xi) have measure fix{Xi) > k, and 
dist(Xi,X2) > D ~ 2e. This proves that Sep(X; k, k) > 
D. □ 
Now we can formulate a lemma on how Me inherits mea- 
sure concentration from surrounding bodies: 

Leimna 13 (Measure concentration on M e from that of sur- 
rounding body). Let £ > 0, and let N C M" be a metric 
measure space such that Ui;{Me) C N. Then, for all k > 



/i„(A^)4(n-l) 



ObsDiam(Mi3,2K) < 2e+0bsDiam N, 



Proof. Abbreviate X :~ Me- Suppose that Scp(X; k, k) = 
D, that is, for every S > 0, there are open subsets 
Xi,X2 C X such that fix{Xi) > k and ^x{X2) > k and 
dist(Xi,X2) > D ~ 5. Let := Ue{Xi) C A^. Using 
Lemma nn we get 



,.{X.) = ^ > 1 



- /in(iV)4(n-l) 



1 



> 



^„(iV) 4(n-l 
^in{N) 4(n-l)' 



Since also dist(Xi,X2) > I? — 5 — 2£, we get 
Sep(A^; Ke, Ke) > D — 2e. All in all, we have shown that 

Sep(X; K, k) < 2£ + Sgp(A^; k^, /tj. 

Lemma[T2]yields the chain of inequalities 

ObsDiam(X, 2k) < Sep{X;K,K) 

< Sep{N;K,,K,)+2e 

< ObsDiam(Af, k^) + 2e 

for every k'^ < k^. □ 
Our goal in the following will thus be to find a good 
n-dimensional body N with small observable diameter and 
Ue{Me) C N such that the ratio ^"^ ^^^^y^^-* appearing in the 
previous lemma is not too small. To this end, note that 

where Ne ^ {x eW^ \ J2k=i ^^xl < E} is the full en- 
ergy ellipsoid. We would like to relate /i„_2(9(S'"^^ n A''^)) 
to iJ,n-i{S"~^ ONe)- For this purpose, the following isoperi- 
metric inequality will be useful. 

Lemma 14 (An isoperimetric inequality). Let n > 3, and let 

B C 5"^^ C M" be a Borel subset which covers at most half 
of the sphere, i.e., iign-i[B) < i. Then, 

fln-2{dB) 1 _ 



Proof. We use the isoperimetric inequality on the sphere 124 
[36i Appendix 1]: Among all Borel sets in S""^^ with fixed vol- 
ume, the minimal volume of the boundary is assumed by a 
round ball. Thus, let C"~^ C 5""^ be a polar cap (i.e., 
round ball) with corresponding angle < i < ^ such that 
^„_i(C"^^) = ^n-i{B). By the isoperimetric inequaUty, 

fin-2{dB) ^ /.„_2(aCr') 



Pn-l{B) '- Pr^-liCr') 



"-2^ -sin^-^i 



Mn-2(5"-2).;J 



sin"-2 t 



fn{t). 



J COS t 



Fort e [0,7r/2),let/i„(t) (n-2) /„ ""^""^ ' 
then /ij^(t) = — sin"~2 i(H-tan2 i) < 0, so /i„ is decreasing. 
Since /i„(0) = 0, it follows that ft,„ (t) < OforalH G [0,7r/2). 
Multiplying the corresponding equation with the non-negative 
expression 



gives for < t < ^ 
nt /^... 2 



(n-2) 



cost 
sint 



sin"-2 t 



,n-2 , 



d6l 



< 0. 



But the left-hand side is exactly (In fn{t))', such that In /„(t) 
is decreasing, and hence /„ {t) is decreasing, so 



fn{t) > fn 



2r(f) 



and this expression is larger than if n > 3 (it grows 
asymptotically like yj^y/n). □ 

To deal with ellipsoids, we need some results on expecta- 
tion values of certain functions. 

Lemma 15 (Ellipsoidal expectation values). Let N C M" be 

the full ellipsoid with equatorial radii {ak}^^i, ak > 0. Then 
we have the following expectation values with respect to the 
geometric measure in N: 

„ u u2 al + al + ... + al 
t(V x = , 



En\\x\\' 



(n + 2)(n + 4) 



Proof. First, we use a linear transformation to reduce the ex- 
pectation value calculations to integrals on the ball B" = 
{x e R" : ||x|| < 1}. That is, let 3>(x) := diag(ai, . . . , a„)a; 
for x G M", then by the transformation formula for integrals, 
we have 



N 



f{x)dx= / fi<^ix))\dctD<i>{x)\dx, 



where det_D$(a;) — aia2 ■ . ■ a„. The only other non-trivial 
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ingredient are the spherical integrals 



x^dx 



/ Xidx = 



dx 



n + 2 ' 

3M»(g") 
(7i + 2)(n + 4)' 

(n + 2)(n + 4)' 



(15) 



These formulas can be proved directly by applying hyper- 
spherical coordinates, cf. Ref. lH^l- □ 
Now we are ready to estimate the crucial expression 

'^"ti'^iNf^ ^'^^ surrounding elhpsoids N . 

Leiiuna 16 (Ratio of Me and the surrounding ellipsoid). 

Let n > 3, and 



k=l , 



be the harmonic mean energy corresponding to the energy lev- 
els Ek > 0, fc = 1, . . . , n. Suppose that 



E = 



'-{1 + S)Eh 



with some 6 > such that the energy manifold Me is not 
empty. Moreover, suppose that E is less than the median of 
EkX^. on the sphere S"'^^. Let N be a full ellipsoid with 
equatorial radii {a/c}/J=i, o,k > 0, i.e., 



N 



X e 



k=l 



Then, we have 

Hn^2{ME) 



lin{N) 



> 



(1 + 2,5)^ 



1 - 



fc=i 



alEk 



2E' 



^ El 

— 1 k J 



k=l 



Proof. According to the isoperimetric inequality from 
Lemma[T4l we have (with Ne the energy ellipsoid as defined 
directly above that lemma). 



'hNe)) > ^fin-iis^-'nNE). 



Hence it remains to lower-bound /i„_i(5"^^ n Ne)', this is 
exactly the Haar measure probabihty Prob{(?/'|iJ|V') < E}. 
In principle, this probability can be computed exactly; us- 
ing the volume-preserving map from the unit sphere to the 
probability simplex |38l], this probability equals the ratio of 
the volumes of the two bodies that originate from intersect- 
ing the simplex with a hyperplane. This ratio has been com- 
puted in Ref. iM], and the result is J2k:Ek<E Yii^k e~-%\ 
for E < -Bmax- Unfortunately, the result is only valid in 



the non-degenerate case; moreover, despite its simple form, 
it is hard to estimate that value in a way which is useful 
in the current calculation. Thus, we instead use a different 
approach which is based on geometry of the ellipsoid. Let 
^i^) J2k=i ^kxj., and let S*"^^ be the sphere of radius r 
in M". First we show the inequality 

^Jin-l{Sl;^ n Ne) > A"-Vn-i(^r' n Ne) (16) 

for any A G (0,1). This can be seen as follows: let x E 
S^-'^ n Ne such that ||a;|| = r and E{x) < E. Then, 
||A.Tj| = A||a:|| and £'(Aa;) = X'^E{x) < E, such that 
X G Sl;^ n Ne- Hence \{SJ^-^ n Ne) C 5^7^ n Ne, and 
so Hn-iiS'l'^ n Ne) is lower-bounded by ^„_i(A(S';'~^ n 
Ne)), which equals A"" Vn-il-??"^ n Ne). 

Let B" ^ be the set of vectors in R" with norm between a 
and h. With the help of Inequality ( fT6] l, we get 



^lr,^^{Sl'-^r^NE)dT 



< 



Vl+2(5 



(1 + 2(5)^ - 1 



= Ai„_i(5"-^niVB) 

Thus, we have reduced the problem to finding a lower bound 
on fJ.n{NE n ^ i+2s )- Indeed, applying Lemma [TSl to the 
assumptions of this lemma, we see that the expectation value 



of ||.T 

actly 



with respect to the geometric measure in Ne is ex- 



EAr_gj|a;j 



1 



so we can indeed expect that much of the weight of Ne is 



contained in B''' 



To prove this, we use the Chebyshev 



l,Vl+2i5' 

inequality. Let cr^ be the variance of with respect to the 
geometric measure on Ne, then it is easy to see that 



Ne 



\\x\n'< 



2E' 



(7i + 2)(n + 4) 



n ^ 
k=l 

and the probability that a point in Ne is not contained in 

2 

^1 y/ i+'^s upper-bounded by Hence 



flniNEnBl^) 



> 



1 - 



2E^ 



XfIn{NE)- 



2)(n + A)f^^E; 



The claim follows from substituting explicit expressions for 
Hn{NE) and fin{N). □ 
Since we want to show that our energy submanifold Me 
inherits measure concentration from an ellipsoid, we first have 
to prove concentration of measure for ellipsoids; 

Lemma 17 (Measure concentration for ellipsoids). Let N C 

R" be the full ellipsoid with equatorial radii {ak}k^i, ife > 
0. Then, for every k > 0, we have 



ObsDiam(7V, k) < 



4a 



In- 
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where a := maxfe Ofc. 

Proof. In accordance with Refs. ll24ll40ll . for a (convex) body 
K C M" with surface S :— dK, we say that K is strictly 
convex if for every e > there exists some 5 ~ die) > Q such 
that x,y G S and — y|| > e implies {x + y)/2 e (1 — 5)K. 
The unit ball is strictly convex, and one can choose 



> 



6(8) = 1 - 1 - 



for < e < 2. According to Ref. IM p. 37], if A C is 
any measurable subset with e-neighborhood A^, we have 



1 



Now let L = diag(ai, . . . , a„) be the linear map which 
maps the ball i?" onto the ellipsoid N. Since L is linear, it 
preserves the geometric measure; that is, for every measurable 
subset A C i?", we have ^n{L{A)) = ^b"{A). Now let 
B C Nbe any measurable subset. We claim that 



L 



CB, 



To prove this, let 



(L-'B) 



and let x :— L~^y. It follows that there is some x' E L^^{B) 
such that ||a; - a-'|| < |. Let y' := Lx' £ B, then \\y - y'\\ = 
||L(2;-a;')|| < ||L|| • ||a:-.T'|| < £ since = a, so y e B^. 
Thus 



HN{Be) > m{L[{L-^B)^'\'^ =fiB'^(^{L-^B)^'^ 



> 1 - 



1 



^1BAL-'B) 
1 



Let now / : iV ^ R be any 1-Lipschitz function, let m/ be 
the median of / on N, and let A := {x E N \ f{x) < m/} 
such that fiN{A) = i. It follows that 

fiN{x e N I f{x) >mf +e} < ^inIx G N \ x A^} 

= l-flN{Ae) 

1 



< 



-«£V(4a^) 



Repeating the calculation for /(x) 
union bound yields 



nif — e and applying the 



(17) 



Due to the characterization of the observable diameter as 
given in ( fT3] l. we obtain ObsDiam(A^, k) < 2e, and the claim 
follows from expressing e in terms of k. □ 
As a last technical lemma, we need a comparison between 
the mean and the median on the {n — 1) -dimensional sphere. 
We suspect that the statement is well-known, but we have been 
unable to locate a proof in the literature. 



Lemma 18 (Mean vs. median on the sphere). Let f : 

S""^^ M.be any function that is X-Lipschitz with respect to 
the Euclidean distance measure, inherited from the surround- 
ing space M". Moreover, let E^n-i/ denote the expectation 
value, and mgn-if denote the median of f on 5"^^. Then, 



|E5„-i/-m5,.-i/| < 



Att 



Proof. Without loss of generality, we may assume that A = 1. 
We use Levy's Lemma on the sphere Il36ll . Normally, it is for- 
mulated for the geodesic distance p instead of the Euclidean 
distance d; but since d < p, it follows that / must be 1- 
Lipschitz also for the geodesic distance. Since two arbitrary 
points on the sphere always have geodesic distance less than 
or equal to tt, it is clear that \f{x) — mg-n-if \ < w for all 
X e 5*""^. Let e > 0. Levy's Lemma states that 



Ms"-i {\.f{x) 

Abbreviating m : 

|Es„-i/-m| 



ms^-if \ > e}< \l--e 



mgn-if and /i := ^511-1, we get 
= |Es„-i(/-m)|<Es„-i|/-m| 
£ae/i5,^-l{|/ - m| > £}(ie 

Jo 

/>oo 

= / Ms"-i{|./-™| > £}de 
Jq 



<./f 



00 



TT 



e-' ("-^)/^ d£ = , 

2\/^r^ 



This proves the claim. □ 
Now we are ready to prove concentration of measure for 
the manifold that arises from intersecting a sphere with an 
ellipsoid in Euclidean space. 

Theorem 19 (Concentration of measure for Me, M-version). 

Let n > 3, and {EkY^^i '^^y of positive energy levels 
with arithmetic mean Ea ■= ^ X]fc=i ^k, harmonic mean 

Eh ■= (^^ Y^k=i "gr) ' B„uu maxfe Ek, min- 

imum E,nin miufc Ek, and Eq^ := ^ "W' ^'^PPOse 

that E is any energy value which satisfies 



• E^ (I 



• E <Ea- 



Tr{E„ 



-Er,-. 



Eh for some e > 0, 



Then, for every X-Lipschitz function f : Me — >■ K with 
median f, we have for the normalized geometric measure 

M = PMe 



f^{\f-.f\>t} 



< 



E^ l- 



2E^ 



whenever the denominator on the right-hand side is positive. 
The constant /3 > can be chosen as (3 = "^^^^/e < 1075. 
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Moreover, the median f of f on Me can be estimated as 
follows. Let N be the full ellipsoid of points x € R" with 
^kxj. < E {l + i), and let Xn be the Lipschitz con- 
stant of f in N. Then 



|/-Ejv/| < A 



N 



_3_ 

An 



Em 



■O 



where the constant b > can be chosen as b 



1 



< 15, and 



Oln 



e 



2 

max 



1 2l3n2E, 
n ^ n ° ^2 {i _ 2E^ 



£2 £2 



Proof. Let S := We may suppose that not all energy lev- 
els are equal, i.e., there exist k and I such that Ek ^ £"/; 
otherwise, there is nothing to prove. Define the function 
E : M" ^ E by E{x) := J2l=i Ekxl, and E : M" ^ M 
by E{x) := E^i^ + Efc=i(-^fc " E„a^)xI, then E ts"-i = 
E \s.-i. Moreover, \\VE{x)\\^ < 4{E^^^ - E^^^f for all 
||x|| < 1, which proves that the Lipschitz constant of £^ fs"-! 
with respect to the Euclidean distance in K" is upper-bounded 
by 2(£'n]ax — i?min)- Sincc Ea = '&s-n--iE{x), the third con- 
dition together with Lemma [TS] ensures that E is less than or 
equal to the median of £'(2;) on the sphere. 

For every E' , define the energy ellipsoid Ne' via Ne' 
{x e R" I E{x) < £"}. Suppose that x G Ue{ME), so there 
is some y G Me with ||a- — t/j| < e, hence 



\E{x)-E{y)\ < max || V£;(z)|| • |jx - 

2GC/s(Me) 

< 2E^.,,{l + e)e, 



y\\ 



It follows that E{x) < E + 4i?max£ if < e < 1. Conse- 
quently, Ue{ME) C NE+ieE,„.^, and N := NE+ieE„^ will 

be the surrounding body of Me that we use when applying 
Lemma[T3] We arbitrarily fix the value 

E 



4:nEmiix 

which turns out to be an almost optimal choice (clearly, < 
e < 1). Hence N = N^^^_^j^-^ is a full ellipsoid with equato- 
rial radii 



E / 1 
— 1 + - 

Ek\ n 



Using that ((1 + 2(5)"/^ — 1) ^ > e ""^ and a few more easy 
simplifications, we get by applying Lemma [T6l 



l^^Ml ^ 1 ( ^ 

l^niN) 2 I 



2E^ \ / 1 



By LemmafTTland 1 + i < 1 + i, we have measure concen- 
tration in N: 



Then Lemma[T3]vields measure concentration in M e - apply- 
ing that lemma, using the previous inequalities together with 

the fact that E < E^sx and that (l + i) ^ is decreasing and 

hence lower-bounded by lim„^oo ((l + ^)") ^ = e^^, we 
get 



1 E 
ObsDiam(M£;,2K) < h 8W x 

2n V 



Then the first claimed inequality follows from the characteri- 
zation of the observable diameter as given in ( fT4] i. 

To prove the second claim, suppose that / : — > M is any 
[Xn ~ 1) -Lipschitz function, and define for ^ > 

X^:={x^Me : - tom^/| < A^} . 

We already know that ijlme ) is large. LemmafTTIvields 

^iN{Ue{X^)) = — > — — — — 

Pn[N) 4(n-l) ^J,n{N) 

4(n-l) Pn{N) 
We know that we have measure concentration in N; setting 



E / 1 
a := mm Ofc = I -- — 1 + - 



and using eq. ( fTTI ) of LemmafTTl we get for all C > 

PN{xeN : |/(x)-TOw./|>C}<4-e-"'^'/(4-'), 

■~t2 //'Qn-,2 \ 2\ 



and using 11361 Appendix V.4], 

PN{xeN : |/(x)-Ejv/|>C}<8.e-"^'/(32a2A2)^ 



Set now 



n P , 



then y^N{\f{x) - Eat/I < C} > 1 - P. Thus, 

Ue{X{)r\{x^N : - Eat/I < C} ^ 0, 

and if x is any element of that intersection, it holds \f{x) 
Eat/I < C and \ f{x) - tome/| <^ + £, such that 



' "^2 8 

\mMj -^Nf\<a[—\n-] + ^ + e. (18) 



Now we specialize ^ by setting 



'1 4 
ObsDiam(iV, k) < 8 ( In - 

n 3i?inin K 



1 af E /, a 

C := h8 (5+ - 

2n V^min V n 
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where 



:= - - In 1 - 



2E' 



, 4096n5£;2 
In ■ 



> 



The assertion of 



This ^ is chosen such that i^lm^{X^) 
the theorem is then proved by substituting all the previously 
established inequalities into ( fTSl ). □ 
Consider now the assumptions given in Theorem [T] but 

, double 



72 



denote the complex dimension by fi. Define i 
all energy eigenvalues, and substitute this into Theorem [T9| 
After dropping all tildes, this substitution yields the state- 
ments of Theorem [Hand 121 The proof of Theorem |5] can now 
be given as follows. 

Proof of Theorem^ Every ^ A® B can be written 

!'/'> = II V'jfc|j)®|fc>, 

where F^lj) = Ef\j) and HB\k) = E^\k). We embed all 
vectors in real space R^" by introducing coordinates Xjk and 
yjk such that 

First we apply Theorem |2] to estimate the matrix elements 
of -0"^ Trs Embedding the ellipsoid N from 

Theorem |2] into M^", we get a real ellipsoid with equatorial 
radii [E' (l + ^) / ^jk)^ ^ where each equatorial radius ap- 
pears twice, namely for the coordinate axes Xjk and yjk- Let 
v,w G {x, y}, then a transformation to spherical coordinates 
yields 

EN{VabWcd) = 71:7V / ^abWcddz 



E' (l 



J-1 

2n) 



VabWcd dz 



f if (a, b) ^ (c, d) oi v ^ w 

~ [ l^[?2t+2) otherwise, 

where we have used eq. (fTST i and the equation Jg„ xiX2 dx = 
0. Hence 



and for the imaginary part, we get 







< 



ifp = q. 



Consider the functions rpq{Tp) ^t{p\ilj^\q) and ipq{ip) := 
"^{plij^lq). All corresponding Lipschitz constants Xn in the 
ellipsoid N satisfy 



Aat < 2 



E' 
EL, 



1 



since this square root denotes the largest equatorial radius, 
which is an upper bound to It follows from Theorem |2] 
that the median of both functions satisfies 



'3^(Pc)p,g| 

^{Pc)p,q\ 



< 2 



E' 
EL. 



1 

1 + - 

n 



+ 15o„ 

8n 



= :S 



where 



E' 
EL~ 



ln(2ani 



2n 



By the triangle inequality, we have 



\rpq{^p) - ^{Pc)p,q\>t+2S \rpq{^P) - rpq\ > t 

\ipqilp)-'^{Pc)p,q\>t+2d |ip,(V') 



for every t > 0. Since the Lipschitz constants of rpq and ipq 
in the sphere (and thus in AIe) satisfy A < 2, it follows from 
Theorem[T]that 

Prob{|rp,(^) - ^ipc)p,q\ >2t + 2S}<r] 

and similarly for ipq, where we used the abbreviation 



r] := a ■ cxp — cn t 



1 

An 



2ey/^ 



Combining the two inequalities for the real and the imaginary 
part, we get 



ENlppklpqk = [{XpkXqk + ypkyqk) + iiVpkXqk - Xpkyqk)] Prob | | (p| V''^ I?) " (Pc)p,g | > %/2(2t + 25 ) | < 2r?. 



Sp,qE' (1 



2n 



E'pk(^ + ^) 

A \B\ ~ 

Since = J2k=i "^pk^qk^ this yields 



By the union bound, the probability that there exist indices p 
and q such that | {p\'\li^\q) — {pc)p.q \ > V2{2t + 25) is upper- 
bounded by |A|(|A| + 1)77. If this is not the case, i.e., if no 
such indices exist, then 



Ejv(pl^^k) 



up,q 



To bound the Lipschitz constants, we compute gradients: the 
result for the real part is 



|V3?(p|^^|g)|r = 



E 



(x^ 



Thus, 
Prob 



p,q 



< 2\A\'^{2t+26f 



P3 
^2 



y'qj 



ypi) 



pj/ 



< 



4||?M|2 ifp = g, 



{ik"" Pc\\l < S\A\\t + 5)2} > 1 - 1^1(1^1 + 1)77. 

This proves the claim. □ 
As stated in the introduction, we now give a proof that it is 
always possible to shift the energy offset such that the energy 
E in question becomes (close to) the harmonic mean energy. 
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Lemma 20 (Harmonic mean and energy shifts). Suppose we 
are given energy levels {Ek}^^i and an energy E between 
the smallest and the arithmetic mean energy Ea, that is 



with density proportional to cxp(— na;^/2). Then, normalize 



1 " 

71 ^ ^ 



mini^fc < E < Ea 
k n 

k=l 

Then there exists AE G M such that the harmonic mean of the 
energies {Ek + ASj^J^j^ equals E + AE, and all energies 
are non-negative: Ek + AE > 0. Moreover, AE is unique 
unless all energies are equal. 

Proof. Denote the harmonic mean of n energy values 
Eu...,Enhy EH{Ekn=^ ELi i:) similarly, 

we use EA{Ek}k^i :~ -^^22=1^^ to emphasize the de- 
pendence of the arithmetic mean Ea on the energy values. 
Let E{AE) := EnlEk + AE}'^^^ - AE which defines a 
continuous function. We may assume without restriction that 

El = miiii Ei and En = max, Ei. If AE = —Ei, then 

E{AE) = EH{Ek-Ei}l^i+Ei = Eh{0, . . .}+Ei = E^. 

It remains to show that limA£;->oo E{AE) = Ea, then 
by continuity there must be some AE > —Ei such that 
E{AE) — E. The limit identity we would like to show is 
equivalent to 

lim {EnlEk + AEy^^^ - EaIE^ + AEy^^^) = 0. 

AS— >oo 



We apply an inequality given by Furuta 114 lH : 

EA{Ek + AE}l^i > EniEk + AEj^^^^ 
4{Ei + AE){Er, + AE) 



> 



{El +En+2AEY 

V 

^1 for A£;->oo 



EA{Ek + AE}l^^. 



This proves existence of some AE such that E{AE) = E. 
In order to see uniqueness, note that the Cauchy-Schwarz 
inequality for two vectors v = and a = 

(oi, . . . , a„)-^, i.e. |(w,a)p < {v,v){a,a) implies that 

«fe) < ^ Y^k=i we have equality if and 

only if ai = a2 = . . . = a„. The derivative of the function E 
turns out to be 



E\x) = 



^ng(ii;, + .)^)/(ngi?,+x) 



- 1 



which must thus be strictly positive unless all energies are 
equal. □ 



the resulting vector: For r 
point 

iXi,X2 



{XI 



+ X2)5,the 



is uniformly distributed on the unit hypersphere. 

If the uniform distribution on the hypersphere shall be sam- 
pled only approximately, then the normalization is not neces- 
sary: we have the norm expectation value E||Xjp = 1 and the 
variance Var||X||^ = ^, such that the distribution of the vec- 
tors X themselves closely resembles the uniform distribution 
on the sphere in high dimensions n. This way, expectation val- 
ues of functions / : M" — > R with respect to the uniform dis- 
tribution on the sphere can be estimated numerically to good 
accuracy (assuming that / is slowly varying and not growing 
too fast at infinity). This has the quantum interpretation (if n 
is even) of drawing random pure states in C"/^. 

It turns out that a simple modification of this algorithm 
yields approximate sampling of the energy manifold Me, or 
rather of its measure pm,^ = Prob that we use in this paper 
We describe the algorithm below. In contrast to the rest of this 
paper, we do not give explicit error bounds in this case, be- 
cause the necessary calculations are straightforward but very 
lengthy, and the resulting error bounds depend sensitively on 
the assumptions on the regularity of the functions / that are 
considered. However, we discuss a rough estimate of the error 
at the end of this section. 

Algorithm 21 (Approximate sampling of Me)- Suppose we 
are given an observable H = on C" with eigenvalues 
{^fe}fc=i and an energy value E such that Theorem[T] applies 
and proves sufficient concentration of measure. Then, the uni- 
form (Hausdorff) measure on the manifold of quantum states 
with {'il)\H\ilj) — E can be numerically sampled in the 
following way: 

1. Find an energy shift s G M such that H' := H + si > 
0, and such that the harmonic mean of the new energy 
levels E'j^ := Ek + s equals E' := E + s, i.e.. 



E'h 



1 " 1 



E[ 
1 



E'. 



2. Generate all real and imaginary parts KV"*: and ^ipk of 
(in the eigenbasis of H) independently according to 
the Gaussian distribution proportional to 



Ei 



exp ^ -n^x^ 



VI. APPROXIMATE SAMPLING OF THE MANIFOLD 

There is a well-known method ll42i l43ll to pick random 
points from the surface of a hypersphere S*""^ C K": Gen- 
erate Xi,X2, ■ . ■ , Xn random real numbers, distributed in- 
dependently identically according to the normal distribution 



If H = \ and E = 1 (which means that we have a void 
constraint), this algorithm reduces to the well-known sphere 
point picking algorithm as a special case (note that the real 
dimension is 2n, which cancels a factor ^ in the exponent). 
If H is not proportional to the identity, then the entries of the 
random vector \^) are independently, but not identically dis- 
tributed. Note that a similar "Gaussian approximation" has 
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been used in Ref. 112 111 right from the start in the analysis of 
the mean energy ensemble (without error bounds). 

Without using the results in this paper, direct calculation 
shows that the distribution generated by the algorithm above 
satisfies 



a 



\W = 



= 1 



H 



(explaining the choice of the energy shift) and 'K{il>\H' - 

E' . The corresponding variances are \si{ij)\H'\'ij)) = 
and 



1 " 



fc=i 



this expression is also present in Theorem[T] where it is called 
E''^ / (uE'q) and assumed to be small (the factor n is absorbed 
into £ there). 

Thus, the algorithm above produces points close to the en- 
ergy manifold Me with high probability. But does it approx- 
imate the uniform distribution on A/^? Since physics mainly 
involves computing expectation values of observables, we are 
interested in a weak form of approximation where we say that 
two measure /i and i/ on C" ~ R^" (or on submanifolds) 
are close, i.e., fi Ri f, if E^f « E^/ for all real functions 
/ : C" M that satisfy certain regularity conditions (such as 
Lipschitz continuity and polynomial growth at infinity). 

For example, the uniform measure on the sphere /^s2n-i 
and in the ball ^b^^ are close if n is large: Since 

AiB2„ {|^) I IIV^II < 1 - £} = (1 - £)2" < exp{-2ne), 

most of the points in the ball are close to the surface. As 
a consequence, a simple calculation shows that expectation 
values of A-Lipschitz functions / : B^" M. satisfy 



|EB2n/-Es2„-l/| < 



A 



2n+l 



Are the uniform measure /ia/e and the resulting Gaussian 
measure from Algorithm |2T| close in this sense? The answer 
is yes, and the results in this paper give a simple geometric 
explanation for this fact, which is schematically depicted in 
Figure [H 

a) As explained at the beginning of this section, it is well- 
known how to pick random points approximately from 
the uniform distribution on the sphere 5*^""^: choose 
real and imaginary parts randomly, distributed indepen- 
dently identically according to a Gaussian distribution 
proportional to exp(— nx^). 

b) We have just seen that the uniform distribution in the 
ball B^" and on the sphere are close to each 
other. Hence the algorithm from a) also samples the 
uniform distribution in the ball to good approximation. 

c) Let N' be the full ellipsoid with equatorial radii 




FIG. 1: Geometric caricature why Algorithm 12 1 1 samples the mea- 
sure hai^ on the energy manifold to good approximation. Point a) 
denotes the well-known sphere point picking algorithm, and e) is for 
Algorithm |2T] See the text below for an explanation how the one 
leads to the other. 



in the directions of the eigenvectors of H', i.e., 

iV':={zGC I {z\H'\z) <E'}. 

Then the ball B^" and the ellipsoid A^' are related by 
a linear transformation L : C" C" which pre- 
serves the normalized geometric volume measure: L 
diag(ai, . . . , a„), then N' = Li?^^") and iib'^" = 

L* iflN')- 

Sampling the ball i?^", and then applying the linear 
transformation L, is the same as sampling the full el- 
lipsoid N'. Writing y = Lx, the components ijk of 
vectors after the transformation are related to the com- 
ponents xk before by yk = {E' / E'/^)^ Xk- Hence 



exp (— na;|) = exp 



exp 



E^ 

' E' 




Thus, we have shown that Algorithm [2T| samples the 
full ellipsoid A^' to good approximation. 

d) The full ellipsoid N' is close the the full ellipsoid A^ 
from Theorem|2] There, it was shown that the uniform 
volume measure in A^ is close to the uniform measure 

on the energy manifold Me- Hence the uniform 
volume measure in A^' is close to ^ime ■ 

e) In summary, the measure produced by Algorithm 121] is 
close to iime as claimed - assuming that the underly- 
ing Hamiltonian is in the range of applicability of The- 
orem[T]and|2] 

The sampling algorithm gives a simple method for a numer- 
ical check of identities such as the form of the reduced density 
matrix in Example |4] It is interesting to note that even though 
the typical reduced density matrices are not Gibbs states (cf. 
Theorem |5]l, the distribution involved in Algorithm |2T] in- 
volves the Boltzmann-like term exp(— c£^^), where c > is 
constant, and E'^ denotes the fc-th energy level. 
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How good is the approximation given by Algorithm |2TI ? 
As explained above, the algorithm is meant to approximate 
expectation values of functions / : R" — > K on the energy 
manifold with respect to the measure ■ Thus, we would 
like to estimate the expression |E/ — Ej\/^ /|, where E denotes 
the expectation value with respect to the Gaussian measure 
used in the algorithm. As a lower bound on that error (for 
some /), recall that 

Var|l^f = E(|l^f -1)' = -^. 

The function /(■)/') := (IIV'IP ~ l)^ is Lipschitz continuous, 
and the upper bound on the Lipschitz constant ||V/(V')|| ~ 
4||Vj||(1 - IIV^IP) < 8/(3^3) on the unit ball does not grow 
with n. Assume for simplicity that E' and E'q are constant in 
n (like in Example [3] and Example H]). Then 

|E/-Em,/| = ^--, 
E'q n 

which shows that we have to expect at least an error of the or- 
der 1/n even for functions / and Hamiltonians H that behave 
very regularly. 

A rough upper bound on the error can be given by adding 
the error contributions of steps a), b), c), and d) in Algo- 
rithm |2T] It seems that the dominant contribution comes from 
step d) - a corresponding error estimate is given in Theorem|2] 
It is roughly of the order again assuming that the ener- 

gies and the Lipschitz constant are constant in n. 



VII. CONCLUSIONS 

In this work, we have established the notion of concentra- 
tion of measure for quantum states with a fixed expectation 
value. The results that we established constitute on the one 
hand a new proof tool to assess properties of quantum states 
with the probabilistic method. Such a proof tool is expected 
to be helpful in a number of contexts, e.g., when sharpening 
counterexamples to additivity by enforcing a strong "conspir- 
acy" by means of a suitable Hilbert Schmidt constraint, adding 
to the portfolio of techniques available related to the idea of a 
probabilistic method. 

On the other hand, in this work we are in the position to in- 
troduce concentration of measure ideas to notions from quan- 
tum statistical mechanics, specifically to the mean energy en- 
semble, and link this physically meaningful ensemble to ideas 
of typicality. Obviously, a constraint of the type introduced 
here could as well relate to settings where the particle num- 
ber is held constant, so is expected to be applicable to a quite 
wide range of physical settings. It is also the hope that meth- 
ods similar to the ones established here also help assessing 
questions of typicality in the context of quantum dynamics 
and addressing key open problems in the theory of relaxation 
ifisl - lisll of non-equilibrium complex quantum systems. 
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